Overview
Brought to you by YData
Dataset statistics
| Number of variables | 10 |
|---|---|
| Number of observations | 1384617 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 105.6 MiB |
| Average record size in memory | 80.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 2 |
eval_set has constant value "train" | Constant |
order_dow has 324026 (23.4%) zeros | Zeros |
days_since_prior_order has 17044 (1.2%) zeros | Zeros |
Reproduction
| Analysis started | 2024-11-01 21:07:35.426262 |
|---|---|
| Analysis finished | 2024-11-01 21:08:03.001191 |
| Duration | 27.57 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
order_id
Real number (ℝ)
| Distinct | 131209 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1706297.6 |
| Minimum | 1 |
|---|---|
| Maximum | 3421070 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 170761 |
| Q1 | 843370 |
| median | 1701880 |
| Q3 | 2568023 |
| 95-th percentile | 3249514.2 |
| Maximum | 3421070 |
| Range | 3421069 |
| Interquartile range (IQR) | 1724653 |
Descriptive statistics
| Standard deviation | 989732.65 |
|---|---|
| Coefficient of variation (CV) | 0.5800469 |
| Kurtosis | -1.2066256 |
| Mean | 1706297.6 |
| Median Absolute Deviation (MAD) | 861914 |
| Skewness | 0.0063315648 |
| Sum | 2.3625687 × 1012 |
| Variance | 9.7957072 × 1011 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 1395075 | 80 | < 0.1% |
| 2813632 | 80 | < 0.1% |
| 949182 | 77 | < 0.1% |
| 2869702 | 76 | < 0.1% |
| 341238 | 76 | < 0.1% |
| 312611 | 75 | < 0.1% |
| 1465173 | 74 | < 0.1% |
| 1355077 | 74 | < 0.1% |
| 653280 | 72 | < 0.1% |
| 288915 | 72 | < 0.1% |
| Other values (131199) | 1383861 |
| Value | Count | Frequency (%) |
| 1 | 8 | < 0.1% |
| 36 | 8 | < 0.1% |
| 38 | 9 | < 0.1% |
| 96 | 7 | < 0.1% |
| 98 | 49 | |
| 112 | 11 | < 0.1% |
| 170 | 17 | < 0.1% |
| 218 | 5 | < 0.1% |
| 226 | 13 | < 0.1% |
| 349 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 3421070 | 3 | < 0.1% |
| 3421063 | 4 | < 0.1% |
| 3421058 | 8 | < 0.1% |
| 3421056 | 5 | < 0.1% |
| 3421049 | 6 | < 0.1% |
| 3421026 | 6 | < 0.1% |
| 3420998 | 28 | |
| 3420996 | 11 | < 0.1% |
| 3420979 | 6 | < 0.1% |
| 3420909 | 10 | < 0.1% |
product_id
Real number (ℝ)
| Distinct | 39123 |
|---|---|
| Distinct (%) | 2.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25556.236 |
| Minimum | 1 |
|---|---|
| Maximum | 49688 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3397 |
| Q1 | 13380 |
| median | 25298 |
| Q3 | 37940 |
| 95-th percentile | 47601 |
| Maximum | 49688 |
| Range | 49687 |
| Interquartile range (IQR) | 24560 |
Descriptive statistics
| Standard deviation | 14121.272 |
|---|---|
| Coefficient of variation (CV) | 0.55255682 |
| Kurtosis | -1.1537944 |
| Mean | 25556.236 |
| Median Absolute Deviation (MAD) | 12122 |
| Skewness | -0.022354791 |
| Sum | 3.5385598 × 1010 |
| Variance | 1.9941034 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24852 | 18726 | 1.4% |
| 13176 | 15480 | 1.1% |
| 21137 | 10894 | 0.8% |
| 21903 | 9784 | 0.7% |
| 47626 | 8135 | 0.6% |
| 47766 | 7409 | 0.5% |
| 47209 | 7293 | 0.5% |
| 16797 | 6494 | 0.5% |
| 26209 | 6033 | 0.4% |
| 27966 | 5546 | 0.4% |
| Other values (39113) | 1288823 |
| Value | Count | Frequency (%) |
| 1 | 76 | |
| 2 | 4 | < 0.1% |
| 3 | 6 | < 0.1% |
| 4 | 22 | < 0.1% |
| 5 | 1 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 13 | < 0.1% |
| 9 | 5 | < 0.1% |
| 10 | 119 | |
| 11 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 49688 | 4 | < 0.1% |
| 49687 | 1 | < 0.1% |
| 49686 | 7 | < 0.1% |
| 49683 | 2413 | |
| 49682 | 5 | < 0.1% |
| 49681 | 8 | < 0.1% |
| 49680 | 46 | < 0.1% |
| 49679 | 4 | < 0.1% |
| 49678 | 21 | < 0.1% |
| 49677 | 8 | < 0.1% |
add_to_cart_order
Real number (ℝ)
| Distinct | 80 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.7580443 |
| Minimum | 1 |
|---|---|
| Maximum | 80 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 7 |
| Q3 | 12 |
| 95-th percentile | 23 |
| Maximum | 80 |
| Range | 79 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 7.4239365 |
|---|---|
| Coefficient of variation (CV) | 0.84767058 |
| Kurtosis | 4.1722265 |
| Mean | 8.7580443 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 1.6855488 |
| Sum | 12126537 |
| Variance | 55.114833 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 131209 | 9.5% |
| 2 | 124364 | 9.0% |
| 3 | 116996 | 8.4% |
| 4 | 108963 | 7.9% |
| 5 | 100745 | 7.3% |
| 6 | 91850 | 6.6% |
| 7 | 83142 | 6.0% |
| 8 | 74601 | 5.4% |
| 9 | 66618 | 4.8% |
| 10 | 59401 | 4.3% |
| Other values (70) | 426728 |
| Value | Count | Frequency (%) |
| 1 | 131209 | |
| 2 | 124364 | |
| 3 | 116996 | |
| 4 | 108963 | |
| 5 | 100745 | |
| 6 | 91850 | |
| 7 | 83142 | |
| 8 | 74601 | |
| 9 | 66618 | |
| 10 | 59401 |
| Value | Count | Frequency (%) |
| 80 | 2 | < 0.1% |
| 79 | 2 | < 0.1% |
| 78 | 2 | < 0.1% |
| 77 | 3 | < 0.1% |
| 76 | 5 | |
| 75 | 6 | |
| 74 | 8 | |
| 73 | 8 | |
| 72 | 10 | |
| 71 | 10 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 828824 | |
| 0 | 555793 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 828824 | |
| 0 | 555793 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 828824 | |
| 0 | 555793 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1384617 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 828824 | |
| 0 | 555793 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1384617 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 828824 | |
| 0 | 555793 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1384617 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 828824 | |
| 0 | 555793 |
user_id
Real number (ℝ)
| Distinct | 131209 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103112.78 |
| Minimum | 1 |
|---|---|
| Maximum | 206209 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 10425 |
| Q1 | 51732 |
| median | 102933 |
| Q3 | 154959 |
| 95-th percentile | 195696 |
| Maximum | 206209 |
| Range | 206208 |
| Interquartile range (IQR) | 103227 |
Descriptive statistics
| Standard deviation | 59487.148 |
|---|---|
| Coefficient of variation (CV) | 0.57691342 |
| Kurtosis | -1.2007212 |
| Mean | 103112.78 |
| Median Absolute Deviation (MAD) | 51608 |
| Skewness | -0.0003274701 |
| Sum | 1.4277171 × 1011 |
| Variance | 3.5387208 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 197541 | 80 | < 0.1% |
| 149753 | 80 | < 0.1% |
| 63458 | 77 | < 0.1% |
| 83993 | 76 | < 0.1% |
| 189951 | 76 | < 0.1% |
| 169647 | 75 | < 0.1% |
| 31611 | 74 | < 0.1% |
| 104741 | 74 | < 0.1% |
| 181991 | 72 | < 0.1% |
| 59321 | 72 | < 0.1% |
| Other values (131199) | 1383861 |
| Value | Count | Frequency (%) |
| 1 | 11 | < 0.1% |
| 2 | 31 | |
| 5 | 9 | < 0.1% |
| 7 | 9 | < 0.1% |
| 8 | 18 | |
| 9 | 22 | |
| 10 | 4 | < 0.1% |
| 13 | 5 | < 0.1% |
| 14 | 11 | < 0.1% |
| 17 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 206209 | 8 | < 0.1% |
| 206205 | 19 | |
| 206203 | 13 | |
| 206200 | 19 | |
| 206199 | 22 | |
| 206198 | 13 | |
| 206196 | 15 | |
| 206195 | 6 | < 0.1% |
| 206193 | 6 | < 0.1% |
| 206191 | 23 |
eval_set
Categorical
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 10.6 MiB |
| train |
|---|
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | train |
|---|---|
| 2nd row | train |
| 3rd row | train |
| 4th row | train |
| 5th row | train |
Common Values
| Value | Count | Frequency (%) |
| train | 1384617 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| train | 1384617 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1384617 | |
| r | 1384617 | |
| a | 1384617 | |
| i | 1384617 | |
| n | 1384617 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6923085 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 1384617 | |
| r | 1384617 | |
| a | 1384617 | |
| i | 1384617 | |
| n | 1384617 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6923085 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 1384617 | |
| r | 1384617 | |
| a | 1384617 | |
| i | 1384617 | |
| n | 1384617 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6923085 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 1384617 | |
| r | 1384617 | |
| a | 1384617 | |
| i | 1384617 | |
| n | 1384617 |
order_number
Real number (ℝ)
| Distinct | 97 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.09141 |
| Minimum | 4 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 MiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 6 |
| median | 11 |
| Q3 | 21 |
| 95-th percentile | 52 |
| Maximum | 100 |
| Range | 96 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 16.614037 |
|---|---|
| Coefficient of variation (CV) | 0.97206939 |
| Kurtosis | 5.8967139 |
| Mean | 17.09141 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 2.2433716 |
| Sum | 23665057 |
| Variance | 276.02621 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 149882 | 10.8% |
| 5 | 123548 | 8.9% |
| 6 | 105328 | 7.6% |
| 7 | 90949 | 6.6% |
| 8 | 75645 | 5.5% |
| 9 | 68366 | 4.9% |
| 10 | 60216 | 4.3% |
| 11 | 51530 | 3.7% |
| 12 | 47819 | 3.5% |
| 13 | 42072 | 3.0% |
| Other values (87) | 569262 |
| Value | Count | Frequency (%) |
| 4 | 149882 | |
| 5 | 123548 | |
| 6 | 105328 | |
| 7 | 90949 | |
| 8 | 75645 | |
| 9 | 68366 | |
| 10 | 60216 | |
| 11 | 51530 | 3.7% |
| 12 | 47819 | 3.5% |
| 13 | 42072 | 3.0% |
| Value | Count | Frequency (%) |
| 100 | 7624 | |
| 99 | 250 | < 0.1% |
| 98 | 292 | < 0.1% |
| 97 | 324 | < 0.1% |
| 96 | 469 | < 0.1% |
| 95 | 373 | < 0.1% |
| 94 | 431 | < 0.1% |
| 93 | 358 | < 0.1% |
| 92 | 416 | < 0.1% |
| 91 | 370 | < 0.1% |
order_dow
Real number (ℝ)
Zeros 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7013918 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 324026 |
| Zeros (%) | 23.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.1676456 |
|---|---|
| Coefficient of variation (CV) | 0.80241809 |
| Kurtosis | -1.3989458 |
| Mean | 2.7013918 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.1755159 |
| Sum | 3740393 |
| Variance | 4.6986876 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 324026 | |
| 6 | 207279 | |
| 1 | 205978 | |
| 5 | 176910 | |
| 2 | 160562 | |
| 4 | 155481 | |
| 3 | 154381 |
| Value | Count | Frequency (%) |
| 0 | 324026 | |
| 1 | 205978 | |
| 2 | 160562 | |
| 3 | 154381 | |
| 4 | 155481 | |
| 5 | 176910 | |
| 6 | 207279 |
| Value | Count | Frequency (%) |
| 6 | 207279 | |
| 5 | 176910 | |
| 4 | 155481 | |
| 3 | 154381 | |
| 2 | 160562 | |
| 1 | 205978 | |
| 0 | 324026 |
order_hour_of_day
Real number (ℝ)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.577592 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 9083 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 10 |
| median | 14 |
| Q3 | 17 |
| 95-th percentile | 21 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 4.238458 |
|---|---|
| Coefficient of variation (CV) | 0.31216566 |
| Kurtosis | 0.043845727 |
| Mean | 13.577592 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.12102981 |
| Sum | 18799765 |
| Variance | 17.964526 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14 | 119370 | 8.6% |
| 15 | 116198 | 8.4% |
| 13 | 114762 | 8.3% |
| 11 | 114119 | 8.2% |
| 12 | 111752 | 8.1% |
| 10 | 110479 | 8.0% |
| 16 | 110237 | 8.0% |
| 17 | 96944 | 7.0% |
| 9 | 93856 | 6.8% |
| 18 | 76522 | 5.5% |
| Other values (14) | 320378 |
| Value | Count | Frequency (%) |
| 0 | 9083 | 0.7% |
| 1 | 5626 | 0.4% |
| 2 | 3226 | 0.2% |
| 3 | 2438 | 0.2% |
| 4 | 2431 | 0.2% |
| 5 | 3847 | 0.3% |
| 6 | 11847 | 0.9% |
| 7 | 36302 | 2.6% |
| 8 | 67386 | |
| 9 | 93856 |
| Value | Count | Frequency (%) |
| 23 | 16965 | 1.2% |
| 22 | 27319 | 2.0% |
| 21 | 34813 | 2.5% |
| 20 | 40920 | 3.0% |
| 19 | 58175 | |
| 18 | 76522 | |
| 17 | 96944 | |
| 16 | 110237 | |
| 15 | 116198 | |
| 14 | 119370 |
days_since_prior_order
Real number (ℝ)
Zeros 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.066126 |
| Minimum | 0 |
|---|---|
| Maximum | 30 |
| Zeros | 17044 |
| Zeros (%) | 1.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 10.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 7 |
| median | 15 |
| Q3 | 30 |
| 95-th percentile | 30 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 23 |
Descriptive statistics
| Standard deviation | 10.426418 |
|---|---|
| Coefficient of variation (CV) | 0.61094228 |
| Kurtosis | -1.5712889 |
| Mean | 17.066126 |
| Median Absolute Deviation (MAD) | 9 |
| Skewness | 0.074891246 |
| Sum | 23630048 |
| Variance | 108.71019 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30 | 407265 | |
| 7 | 106801 | 7.7% |
| 6 | 72138 | 5.2% |
| 8 | 61821 | 4.5% |
| 5 | 54117 | 3.9% |
| 14 | 51690 | 3.7% |
| 4 | 45727 | 3.3% |
| 9 | 43410 | 3.1% |
| 13 | 39081 | 2.8% |
| 3 | 36550 | 2.6% |
| Other values (21) | 466017 |
| Value | Count | Frequency (%) |
| 0 | 17044 | 1.2% |
| 1 | 19265 | 1.4% |
| 2 | 27504 | 2.0% |
| 3 | 36550 | 2.6% |
| 4 | 45727 | |
| 5 | 54117 | |
| 6 | 72138 | |
| 7 | 106801 | |
| 8 | 61821 | |
| 9 | 43410 |
| Value | Count | Frequency (%) |
| 30 | 407265 | |
| 29 | 15397 | 1.1% |
| 28 | 21223 | 1.5% |
| 27 | 15460 | 1.1% |
| 26 | 12500 | 0.9% |
| 25 | 14054 | 1.0% |
| 24 | 13947 | 1.0% |
| 23 | 15575 | 1.1% |
| 22 | 20457 | 1.5% |
| 21 | 29173 | 2.1% |
Interactions
Correlations
| add_to_cart_order | days_since_prior_order | order_dow | order_hour_of_day | order_id | order_number | product_id | reordered | user_id | |
|---|---|---|---|---|---|---|---|---|---|
| add_to_cart_order | 1.000 | 0.018 | -0.024 | -0.010 | 0.002 | 0.031 | 0.007 | 0.137 | -0.000 |
| days_since_prior_order | 0.018 | 1.000 | -0.025 | 0.008 | 0.003 | -0.387 | 0.001 | 0.166 | 0.004 |
| order_dow | -0.024 | -0.025 | 1.000 | 0.009 | 0.001 | 0.015 | -0.004 | 0.017 | -0.006 |
| order_hour_of_day | -0.010 | 0.008 | 0.009 | 1.000 | -0.003 | -0.035 | 0.002 | 0.034 | -0.001 |
| order_id | 0.002 | 0.003 | 0.001 | -0.003 | 1.000 | 0.002 | -0.001 | 0.004 | -0.001 |
| order_number | 0.031 | -0.387 | 0.015 | -0.035 | 0.002 | 1.000 | -0.001 | 0.237 | -0.005 |
| product_id | 0.007 | 0.001 | -0.004 | 0.002 | -0.001 | -0.001 | 1.000 | 0.042 | -0.001 |
| reordered | 0.137 | 0.166 | 0.017 | 0.034 | 0.004 | 0.237 | 0.042 | 1.000 | 0.006 |
| user_id | -0.000 | 0.004 | -0.006 | -0.001 | -0.001 | -0.005 | -0.001 | 0.006 | 1.000 |
Missing values
Sample
| order_id | product_id | add_to_cart_order | reordered | user_id | eval_set | order_number | order_dow | order_hour_of_day | days_since_prior_order | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 49302 | 1 | 1 | 112108 | train | 4 | 4 | 10 | 9.0 |
| 1 | 1 | 11109 | 2 | 1 | 112108 | train | 4 | 4 | 10 | 9.0 |
| 2 | 1 | 10246 | 3 | 0 | 112108 | train | 4 | 4 | 10 | 9.0 |
| 3 | 1 | 49683 | 4 | 0 | 112108 | train | 4 | 4 | 10 | 9.0 |
| 4 | 1 | 43633 | 5 | 1 | 112108 | train | 4 | 4 | 10 | 9.0 |
| 5 | 1 | 13176 | 6 | 0 | 112108 | train | 4 | 4 | 10 | 9.0 |
| 6 | 1 | 47209 | 7 | 0 | 112108 | train | 4 | 4 | 10 | 9.0 |
| 7 | 1 | 22035 | 8 | 1 | 112108 | train | 4 | 4 | 10 | 9.0 |
| 8 | 36 | 39612 | 1 | 0 | 79431 | train | 23 | 6 | 18 | 30.0 |
| 9 | 36 | 19660 | 2 | 1 | 79431 | train | 23 | 6 | 18 | 30.0 |
| order_id | product_id | add_to_cart_order | reordered | user_id | eval_set | order_number | order_dow | order_hour_of_day | days_since_prior_order | |
|---|---|---|---|---|---|---|---|---|---|---|
| 1384607 | 3421058 | 30316 | 6 | 1 | 136952 | train | 20 | 3 | 18 | 15.0 |
| 1384608 | 3421058 | 35578 | 7 | 0 | 136952 | train | 20 | 3 | 18 | 15.0 |
| 1384609 | 3421058 | 32650 | 8 | 1 | 136952 | train | 20 | 3 | 18 | 15.0 |
| 1384610 | 3421063 | 49235 | 1 | 1 | 169679 | train | 30 | 0 | 10 | 4.0 |
| 1384611 | 3421063 | 13565 | 2 | 1 | 169679 | train | 30 | 0 | 10 | 4.0 |
| 1384612 | 3421063 | 14233 | 3 | 1 | 169679 | train | 30 | 0 | 10 | 4.0 |
| 1384613 | 3421063 | 35548 | 4 | 1 | 169679 | train | 30 | 0 | 10 | 4.0 |
| 1384614 | 3421070 | 35951 | 1 | 1 | 139822 | train | 15 | 6 | 10 | 8.0 |
| 1384615 | 3421070 | 16953 | 2 | 1 | 139822 | train | 15 | 6 | 10 | 8.0 |
| 1384616 | 3421070 | 4724 | 3 | 1 | 139822 | train | 15 | 6 | 10 | 8.0 |